Feature Ranking Using Linear SVM
نویسندگان
چکیده
Feature ranking is useful to gain knowledge of data and identify relevant features. This article explores the performance of combining linear support vector machines with various feature ranking methods, and reports the experiments conducted when participating the Causality Challenge. Experiments show that a feature ranking using weights from linear SVM models yields good performances, even when the training and testing data are not identically distributed. Checking the difference of Area Under Curve (AUC) with and without removing each feature also gives similar rankings. Our study indicates that linear SVMs with simple feature rankings are effective on data sets in the Causality Challenge.
منابع مشابه
An Efficient Method for Variables Selection Using SVM-Based Criteria
The problem of feature selection for Support Vector Machines (SVMs) classification is investigated in the linear two classes case. We suggest a new method of feature selection based on ranking scores derived from SVMs. We analyze the retraining effects on the ranking rules based on these scores. Our features selection algorithm consists in a forward selection strategy according to the decreasin...
متن کاملOptimizing Area Under the ROC Curve using Ranking SVMs
Area Under the ROC Curve (AUC), often used for comparing classifiers, is a widely accepted performance measure for ranking instances. Many researches have studied optimization of AUC, usually via optimizing some approximation of a ranking function. Ranking SVMs are among the better performers but their usage in the literature is typically limited to learning a total ranking from partial ranking...
متن کاملMovie Recommendations Using Social Networks
This paper explores utilization of information from social networks in making automatic movie recommendations. Implementations of three different algorithms (SVM, Clustering, and Ranking SVM) are implemented and evaluated. The general approach utilizes a large collection of Facebook profile information as training set in order to generate a list of movie recommendations for a particular user (c...
متن کاملImproving Classification Accuracy via Contextual Feature Ranking in High Spatial Resolution Satellite Imagery
Texture quantization is a useful method for extraction spatial relevance between pixels which is used in humane brain for image interpreting. Beside the spectral bands textural features of high spatial resolution data can be used to improve classification accuracy. Depends on the land cover characteristics different textural features possibly are effective from large number of available textura...
متن کاملAn Empirical Study of Software Metrics Selection Using Support Vector Machine
The objective of feature selection is to identify irrelevant and redundant features, which can then be discarded from the analysis. Reducing the number of metrics (features) in a data set can lead to faster software quality model training and improved classifier performance. In this study we focus on feature ranking using linear Support Vector Machines (SVM) which is implemented in WEKA. The co...
متن کامل